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Abstract 

Internet search companies sell advertisement slots based on users' search queries via an 
auction. Advertisers have to determine how to place bids on the keywords of their interest in 
order to maximize their return for a given budget: this is the budget optimization problem. The 
solution depends on the distribution of future queries. In this paper, we formulate stochastic 
versions of the budget optimization problem based on natural probabilistic models of distribution 
over future queries, and address two questions that arise. 

Evaluation Given a solution, can we evaluate the expected value of the objective function? 

Optimization Can we find a solution that maximizes the objective function in expectation? 

Our main results are approximation and complexity results for these two problems in our three 
stochastic models. In particular, our algorithmic results show that simple prefix strategies that 
bid on all cheap keywords up to some level are either optimal or good approximations for many 
cases; we show other cases to be NP-hard. 

1 Introduction 

Internet search companies use auctions to sell advertising slots in response to users' search queries. 
To participate in these auctions, an advertiser selects a set of keywords that are relevant or descrip- 
tive of her business, and submits a bid for each of them. Upon seeing a user's query, the search 
company runs an auction among the advertisers who have placed bids for keywords matching the 
query and arranges the winners in slots. The advertiser pays only if a user clicks on her ad. Ad- 
vertiser's bid affects the position of the ad, which in turn affects the number of clicks received and 
the cost incurred. In addition to the bids, the advertiser specifies a daily budget. When the cost 
charged for the clicks reaches the budget, the advertiser's ads stop participating in the auctions. 

In what follows, we first model and abstract the budget optimization problem, and then present 
our stochastic versions, before describing our results. 
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1.1 Advertiser's Budget Optimization problem 

We adopt the viewpoint of an advertiser and study the optimization problem she faces. The 
advertiser has to determine the daily budget, a good set of keywords, and bids for these keywords 
so as to maximize the effectiveness of her campaign. The daily budget and the choice of keywords 
are business-specific, so they are assumed to be given in our problem formulation. Effectiveness of 
a campaign is difficult to quantify since clicks resulting from some keywords may be more desirable 
than others, and in some cases, just appearing on the results page for a user's query may have some 
utility. For most of the paper, we adopt a common measure of the effectiveness of a campaign, 
namely, the number of clicks obtained^- Further, seen from an individual advertiser's point of view, 
the budgets and bids of other advertisers are fixed for the day. We develop most of our discussions 
assuming that each keyword has a single winning bid amount, which is fixed throughout the day 
and known in advance. This models the case of an auction with a single slot, and disregards the 
possibility of other advertisers changing their bids or running out of budget. We mention the 
extensions that eliminate some of these assumptions in Section IA.7I 

Under our assumptions, each keyword i has a single threshold bid amount, such that any bid 
below this amount loses the auction and does not get any clicks. Any bid above the threshold wins 
the auction, and gets clicks with cost per click equal to the threshold bid amount^. In this case 
the advertiser's decision for each keyword becomes binary: whether or not to bid on it above its 
threshold. We use decision variables b, L , which can be integral (6j € {0, 1}) or fractional (6j G [0, 1]), 
to indicate whether or not there is a bid on keyword i. A fractional bid represents bidding for b{ 
fraction of the queries that correspond to keyword i, or equivalently bids on each such query with 
probability 6j. Integer bid solutions are slightly simpler to implement than fractional bids and are 
more desirable when they exist. 

Finally, consider the effect of user behavior on the advertiser. We abstract it using the function 
clicksj, which is the number of clicks the advertiser gets for queries corresponding to keyword i. 
Each such click entails a cost cpc^, which is assumed to be known. Now, the advertiser is budget- 
constrained, and some solutions may run out of budget, which decreases the total number of clicks 
obtained. In particular, the advertiser has a global daily budget B, which is used to get clicks for 
all of the keywords. When the budget is spent, the ads stop being shown, and no more clicks can be 
bought. We model the limited budget as follows. Consider a solution b that bids on some keywords. 
If the budget were unlimited, then bidding on those keywords would bring clicks(6) clicks, which 
together would cost cost(b). But when the budget B is smaller than cost(fo), this solution runs 
out of money before the end of the day, and misses the clicks that come after that point. If we 
assume that the queries and clicks for all keywords are distributed uniformly throughout the day 
and are well-mixed, then this solution reaches the budget after B / 'cost (6) fraction of the day passes, 
missing (1 — £?/cost(6)) fraction of the possible clicks for each keyword. As a result, the number 

of clicks collected before the budget is exceeded is i n expectation. 

Based on the discussion so far, we can now state the optimization problem an advertiser faces. 

Definition 1 Budget Optimization Problem (BO). An advertiser has a set T of keywords, 
with \T\ = n, and a budget B. For each keyword i £ T , we are given clicks^, the number of clicks 



1 We extend our results to a more general model where clicks for different keywords may have different values in 
Section [6] 

2 This assumes second-price auctions, where the winner's cost is the highest bid of others. All our results also 
apply to weighted second-price auctions, which are common in search-based advertising. 
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that correspond to i, and cpc i , the cost per click of these clicks. We define costj = cpCj • clicks.;. 
The objective is to find a solution b = (b%, b n ) with a bid < bi < 1 for each i € T to maximize 

value(6) = flicks, (1) 

max (1, }2ieT kcosti/B) 

The numerator of the objective function is the number of clicks available to b, and the denomi- 
nator scales it down in the case that the budget is exceeded. If we define clicks(b) = YlieT frjdicksj, 

cost(b) = ^2 ieT biCosti, and the average cost per click of solution b as cpc(fo) = , then 

CLICKS! C/ ) 

f clicks (b) if cost (b) < B 
value (6) = I K ' y ' ~ (2 

\B/cpc(b) if cost (6) > B 

So maximizing value(6) is equivalent to maximizing the number of clicks in case that we are under 
budget, and minimizing the average cost per click if we are over budget. We always assume that 
the keywords are numbered in the order of non-decreasing cpc^, i.e. cpc x < cpc 2 < • • • < cpc n . 



1.2 Stochastic versions 

Many variables affect the number of clicks that an advertiser receives in a day. Besides the ad- 
vertiser's choice of her own budget and keywords which we take to be given, and the choices of 
other advertisers which remain fixed, the main variable in our problem is the number of queries of 
relevance that users issue on that day, and the frequency with which the ads are clicked o These 
quantities are not known precisely in advance. Our premise is that Internet search companies can 
analyze past data and provide probability distributions for parameters of interest. They currently 
do provide limited amount of information about the range of values taken by these parametersO 
This motivates us to study the problem in the stochastic setting where the goal is to maximize the 
expected value of the objective under such probability distributions. 

In the stochastic versions of our problem, the set of keywords T, the budget B, and the cost per 
click cpCj for each keyword are fixed and given, just like in the BO problem of Definition [TJ What 
is different is that the numbers of clicks clicks^ corresponding to different keywords are random 
variables having some joint probability distribution. But because general joint probability distri- 
butions are difficult to represent and to work with, we formulate the following natural stochastic 
models. (In contrast, the problem where clicks j are known precisely for all i is called the fixed 
model from here on.) 

Proportional Model The relative proportions of clicks for different keywords remain constant. 
This is modeled by one global random variable for the total number of clicks in the day, and 
a fixed known multiplier for each keyword that represents that keyword's share of the clicks. 

Independent Keywords Model Each keyword comes with its own probability distribution for 
the number of clicks, and the samples are drawn from these distributions independently. 

3 The nature and number of queries vary significantly. An example in Google Trends shows the spikes in searches 
for shoes, flowers and chocolate: http : //www. google . com/trends?q=shoes , flowers , chocolate 

4 See for example the information provided to any AdWords advertiser. See also 

s : // adwords . google . com/support 
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Scenario Model There is an explicit list of ./V scenarios. Each scenario specifies the number of 
clicks for each keyword, and has a probability of occurring. We think of N as reasonably 
small, and allow the running time of algorithms to depend (polynomially) on N. 

The scenario model is important for two reasons. For one, market analysts often think of un- 
certainty by explicitly creating a set of a few model scenarios, possibly attaching a weight to each 
scenario. The second reason is that the scenario model gives us an important segue into under- 
standing the fully general problem with arbitrary joint distributions. Allowing the full generality 
of an arbitrary joint distribution gives us significant modeling power, but poses challenges to the 
algorithm designer. Since a naive explicit representation of the joint distribution requires space 
exponential in the number of random variables, one often represents the distribution implicitly by 
a sampling oracle. A common technique, Sampled Average Approximation (SAA), is to replace the 
true distribution T> by a uniform or non-uniform distribution T> over a set of samples drawn by some 
process from the sampling oracle, effectively reducing the problem to the scenario model. For some 
classes of problems, see e.g. [131 [20], it is known that SAA approximates the original distribution 
to within an arbitrarily small error using polynomially many samples. While we are not aware of 
such bounds applicable to the budget optimization problem, understanding the scenario model is 
still an important step in understanding the general problem. 

There are two issues that arise in each of the three stochastic models. 

• Stochastic Evaluation Problem (SE). Given a solution b, can we evaluate i? [value (6) ] 
for the three models above? Even this is nontrivial as is typical in stochastic optimization 
problems. It is also of interest in solving the budget optimization problem below. 

• Stochastic Budget Optimization Problem (SBO). This is the Budget Optimization 
problem with one of the stochastic models above determining clicks j for each i, with the 
objective to maximize 



The expectation is taken over the joint distribution of clicksj for all i € T. 
1.3 Our results 

We present algorithmic and complexity-theoretic results for the SE and SBO problems. 

For SE problems, our results are as follows. The problem is straight-forward to solve for the fixed 
and scenario models since the expression for the expected value of the objective can be explicitly 
written in polynomial time. For the proportional model, we give an exact algorithm to evaluate 
a solution, assuming that some elementary quantities (such as probability of a range of values) 
can be extracted from the given probability distribution in polynomial time. For the independent 
model, the number of possibilities for different click quantities may be exponential in the number 
of keywords, and the problem of evaluating a solution is likely to be #P-hard. We give a PTAS 
for this case. These evaluation results are used to derive algorithms for the SBO problem, though 
they may be of independent interest. 

Our main results are for the SBO problem. In fact, all our algorithms produce a special kind of 
solutions called prefix solutions. A prefix solution bids on some prefix of the list of keywords sorted 




Y,j£T ^clicks.; 



(3) 
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in the increasing order of cost per click (cpcj, i.e., on the cheap ones. Formally, an integer prefix 
solution with bids b{ has the property that there exists some i* such that fej = 1 for all i < i* , and 
bi = for i > i* . For a fractional prefix solution, there exists an i* such that hi = 1 for i < i* , 
&i = for i > i*, and bi* E [0, 1]. We show: 

• For the proportional model, we can find an optimal fractional solution in polynomial time if 
the distribution of clicks can be described using polynomial number of points; else, we obtain 
a PTAS. We get this result by showing that the optimal fractional solution in this case is a 
prefix solution and giving an algorithm to find the best prefix. 

• Our main technical contribution is the result for the independent model, where we prove that 
every integer solution can be transformed to a prefix solution by removing a set of expensive 
keywords and adding a set of cheap ones, while losing at most half of the value of the solution. 
Thus, some integer prefix is always a 2-approximate integer solution. When combined with 
our PTAS for the evaluation problem, this leads to a 2 + e approximation algorithm. We also 
show that the best fractional prefix is not in general the optimal fractional solution in this 
case. 

• For the scenario model, we show a negative result that finding the optimum, fractional or 
integer, is NP-hard. In this case, the best prefix solution is arbitrarily far from the optimum. 

1.4 Related work 

Together, our results represent a new theoretical study of stochastic versions of budget optimization 
problems in search-related advertising. The budget optimization problem was studied recently [9] 
in the fixed model, when clicks j's are known. On one hand, our study is more general, with the 
emphasis on the uncertainty in modeling clicks j's and the stochastic models we have formulated. 
We do not know of prior work in this area that formulates and uses our stochastic models. On the 
other hand, our study is less general as it does not consider the interaction between keywords that 
occurs when a user's search query matches two or more keywords, which is studied in [9]. 

Stochastic versions of many optimization problems have been considered, such as facility lo- 
cation, Steiner trees, bin-packing and LP (see, for example, the survey [20J). Perhaps the most 
relevant to our setting is the work on the stochastic knapsack problem, of which several versions 
have been studied. Dean et al. [8] consider a version of the problem in which item values are fixed, 
and item sizes are independent random variables. The realization of an item's size becomes known 
once it is placed in the knapsack, so an algorithm has to select items one at a time, until the 
knapsack capacity is exceeded. In [12] and [TO], a version of the problem with fixed item values 
and random sizes is considered as well, but there the goal is to choose a valuable set of items whose 
probability of exceeding the knapsack capacity is small. Other authors [3 [Til CEH1 IH] have studied 
versions with fixed item sizes but random values. If viewed as a version of stochastic knapsack, 
our problem is different from all of these in several respects. First, there is no hard capacity con- 
straint, but instead the objective function decreases continuously if the cost of the keywords (which 
is analogous to the size of the items) exceeds the budget (the analog of the knapsack capacity). 
The second difference is that in our model, both the number of clicks and the cost of the keywords 
(i.e. item values and sizes, respectively) are random, but their ratio for each particular keyword 
(item) is fixed and known. Another difference is that previous work on stochastic knapsack con- 
siders independent distributions of item parameters, whereas two of our models (proportional and 
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scenario) have correlated variables. Furthermore, although the greedy algorithm which takes items 
in the order of their value-to-size ratio is well-known and variations of it have been applied to 
knapsack-like problems, our analysis proving the 2-approximation result is new. 

Recently, Chakrabarty et al. [6] considered an online knapsack problem with the assumption of 
small element sizes, and Babaioff et al. [2] considered an online knapsack problem with a random 
order of element arrival, both motivated by bidding in advertising auctions. The difference with our 
work is that these authors consider the problem in the online algorithms framework, and analyze the 
competitive ratios of the obtained algorithms. In contrast, our algorithms make decisions offline, 
and we analyze the obtained approximation ratios for the expected value of the objective. Also, our 
algorithms base their decisions on the probability distributions of the clicks, whereas the authors 
of [2 J and [6] do not assume any advance knowledge of these distributions. The two approaches are 
in some sense complementary: online algorithms have the disadvantage that in practice it may not 
be possible to make new decisions about bidding every time that a query arrives, and stochastic 
optimization has the disadvantage of requiring the knowledge of the probability distributions. 

Also motivated by advertising in search-based auctions, Rusmevichientong and Williamson |17j 
have studied the keyword selection problem, where the goal is to select a subset of keywords from a 
large pool for the advertiser to choose to bid. Their model is similar to our proportional model, but 
the proportions of clicks for different keywords are unknown. An adaptive algorithm is developed 
that learns the proportions by bidding on different prefix solutions, and eventually converges to 
near-optimal profits [T7], assuming that various parameters are concentrated around their means. 
The difference with our work is that we consider algorithms that solve the problem in advance, and 
not by adaptive learning, and work for any arbitrary (but pre-specified) probability distributions. 

There has been a lot of other work on search-related auctions in the presence of budgets, but it 
has primarily focused on the game-theoretic aspects [EJ H] , strategy-proof mechanisms [U [3] , and 
revenue maximization |15|. I14j . 

1.5 Map 

We briefly discuss the fixed case first, and then focus on the three stochastic models in the following 
sections; in each case, we solve both evaluation and BO problems. Finally, we present some 
extensions of our work and state a few open problems. Some of the proofs appear in the Appendix. 

2 Fixed model 

For the BO problem in the fixed model, a certain fractional prefix, which is easy to find, is the 
optimal solution. The algorithm is analogous to that for the fractional knapsack problem. We find 
the maximum index i* such that Yli<i* cos ^i < B. If i* is the last index in T, we set bi = 1 for all 
keywords i. Otherwise find a fraction a € [0, 1) such that Yli<i* cos ^i + ct ■ costj* + i = B, and set 
bi = 1 for i < i*, bi* + \ = a, and bi = for i > i* + 1. 

Theorem 2 In the fixed model, the optimal fractional solution for the BO problem is the maximal 
prefix whose cost does not exceed the budget, which can be found in linear time. 

The integer version of this problem is NP-hard by reduction from KNAPSACK. 
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3 Proportional model 

In the proportional model of SBO, we are given q^, the click frequency for each keyword i G T, 
with X^igr 9* = 1- ^he total number of clicks is denoted by a random variable C, and has a 
known probability distribution p. The number of clicks for a keyword i is then determined as 
clicksj = qi ■ C. For a specific value c of C, let clicks? = qiC and cost? = cpc^clicks?. The 
objective is to maximize the expected number of clicks, given by expression 

Theorem 3 The optimal fractional solution for the SBO problem in the proportional model is a 
fractional prefix solution. 

The proof is by an interchange argument and appears in Section [AJ] We now show how to solve 
the evaluation problem efficiently in the proportional model, and then use it to find the best prefix, 
which by Theorem [3] is the optimal fractional solution to SBO. 

3.1 Evaluating a solution 

Assuming that the distribution for C is given in such a way that it is easy to evaluate Pr[C > c*] 
and Y^c<c* c ' p( c ) f° r an y c *> we sriow how to find £7[value(6)] for any given solution b without 
explicitly going through all possible values of C and evaluating the objective function for each one. 

The solution b may be under or over budget depending on the value of C. Define a threshold 
c* = B j YlieT &t9i c P c i> so that for c < c*, cost c (fo) < B, and for c > c*, cost c (6) > B. Notice 
that in the proportional model, cpc(6) is independent of C, as both clicks(b) and cost (6) are 
proportional to C. Then using expression ((2J) for value(b), the objective becomes easy to evaluate: 



3.2 Finding the optimal prefix 

It is nontrivial to find the best fractional prefix solution for the proportional case, and we mention 
two approaches that do not work. One simple way to find a prefix in the proportional model is to 
convert it to a fixed case problem by setting the number of clicks for a keyword to its expectation. 
This approach fails on an example with two keywords and two possible values of C: a likely value of 
zero and a small-probability large value. Another approach is some greedy procedure that lengthens 
the prefix while the solution improves. This does not work either, because the expected value of 
the solution as a function of the length of the prefix can have multiple local maxima. 

The best prefix can be found by producing a list of 0(n + 1) prefixes (out of uncountably many 
possible ones) containing the optimum, as explained in Section [A.2[ Here t is the number of possible 
values of C. If t is not polynomial in n, the probability that C falls between successive powers of 
(1 + e) can be combined into buckets, yielding a PTAS for the problem. 

Theorem 4 The optimal fractional solution to SBO problem in the proportional model can be 
found exactly in time 0(n + t), where t is the number of possible values of C, or approximated by 




a PTAS. 
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4 Independent model 



In the independent model of SBO, the number of clicks for keyword i£T, clicks^, has a probability 
distribution pi (which can be different for different keywords). The key distinguishing feature of 
this model is that for i ^ j, the variables clicks^ and clicks^ are independent. This model is 
more complex than the ones discussed so far. A three-keyword example in Section [A. 31 shows the 
following. 

Theorem 5 In the independent model of the SBO problem, the optimal fractional solution may 
not be a prefix solution. 

However, in Section 14.11 we prove that some integer prefix solution is a 2-approximate integer 
solution. But finding the best integer prefix requires the ability to evaluate a given solution, which 
in this model is likely to be ^P-hard. We develop a PTAS, based on dynamic programming, for 
evaluating a solution (the algorithm is presented in Section IA.4p . Combined, these two results 
imply a (2 + e)-approximation for the SBO problem in the independent model. 



4.1 Prefix is a 2-approximation 

In this section we show that for any instance of the SBO problem in the independent model, there 
exists an integer prefix solution whose expected value is at least half that of the optimal integer 
solution. In particular, any integer solution b can be transformed into a prefix solution by without 
losing more than half of its value. Let S = {i \ bi = 1} be the set of keywords that b bids on. 

We make some definitions that allow us to specify the prefix solution by precisely in Theorem 
El Let a be the event that clicks for each keyword i € T come in quantity clicks CT (f). Then 
its probability is p(o~) = Yl ieT Pi (clicks' 7 (i)). Define the number of clicks available to solution b 
in the event a as clicks CT (6) = ^ igS clicks' 7 (i), the corresponding cost per keyword cost""(i) = 
cpCj • clicks' 7 (i), and total cost cost cr (b) = X^ie5 cos t CT (&)- The effective number of clicks (after 
taking the budget into account) that solution b gets from keyword i in the event a is 

clicks' 7 (i) 



clicks^i) 



max(l,cost' 7 (b)/ J B)' 



and the total effective number of clicks is clicks (6) = ^2 ieS clicks 5 (i). Then the objective becomes 
the sum of effective number of clicks in each scenario, weighted by that scenario's probability: 
£7[value(6)] = £ ff p^cHcks^fj). 

Let i*(b) be the minimum index i* such that keywords up to i* account for half the clicks: 



J>(<7) clicks^*) > I £[value(b)]. 



Theorem 6 For any integer solution b to the SBO problem with independent keywords, there exists 
an integer prefix solution by such that Rvalue (fry)] > ^ E[value(b)] . In particular, the solution 
by bidding on the set V = {i \ i < i*(b)} has this property. 

The idea of the proof is to think of the above prefix solution as being obtained in two steps from 
the original solution b. First, we truncate b by discarding all keywords after i*. Then we fill in the 
gaps in the resulting solution in order to make it into a prefix. To analyze the result, we first show 
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that all keywords up to i* are relatively cheap, and that the truncated solution (called bjj) retains 
at least half the value of the original one (Claim [7]). Then we show that filling in the gaps preserves 
this guarantee. Intuitively, two good things may happen: either clicks for the new keywords don't 
come, in which case we get all the clicks we had before; or they come in large quantity, spending 
the budget, which is good because they are cheap. Lemma [8] analyzes what happens if new clicks 
spend a a fraction of the budget. 

Let i* = i*(b). To analyze our proposed prefix solution by, we break the set V into two disjoint 
sets U and N. U = V D S = {i < i* \ i £ S} is the set of cheapest keywords that get half the clicks 
of b. The new set N = V \ S = {i < i* \ i $l S} fills in the gaps in U. Let bjj and b^ be the 
solutions that bid on keywords in U and N respectively. 

Define the average cost per click of solution b as 



cpc * = EgP(^) Sieg c P c i clicksg(i) 
E ct pO) clicks' 7 (6) 

where the numerator is the average amount of money spent by fe, and the denominator is the average 
number of clicks obtained. A useful fact to notice is that since the numerator of this expression 
never exceeds the budget, and the denominator is equal to 2i7 [value (&)], we have that 

£[value(6)] < JL. (5) 

We make two observations about bjj and i*. 
Claim 7 E [value (bu)] > \ E [value (b)] and cpCj, < 2cpc*. 

Proof. A keyword in U does not necessarily contribute exactly the same number of clicks when 
it appears as part of solution U as it does when it is part of the solution S. However, in U it 
contributes at least as much. Formally, observe that since U C S, for all a, cost a {bu) < cost°~(b), 
which implies that clicksu(i) > clicks s {i) for any i £ U. So 

E [value (bu)] = ^p(er) ^ clicks u (i) > ^ p (a) ^ clicks ^(i) > - E[vahie(b)] 

by definitions of U and i*. 

The second part follows by Markov's inequality: 

* . Y.aP( a ) Eies, 4 >i* C P C ** clicks^ (t) l 

cpc > j > CpCj, • -, 

Y,aP( a ) clicks (b) 2 
where the second inequality is by minimality of i*. □ 

Now we state the main lemma. 

Lemma 8 For any a, leta a = mm(B, cost " (b^))/B. Then 

B 



clicks (M > a u + (1 - d 7 ) clicks (bu). 

2cpc* 
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Proof. The idea here is that a a is the fraction of the budget spent by the new keywords (ones 
from set N) in the event a. So (1 — a a ) fraction of the budget can be used to buy (1 — a a ) fraction 
of clicks that bjj was getting, and a a fraction is spent on keywords (whether from U or N) that 
cost at most 2cpc*. A more formal analysis follows. 
If cost CT (6jv) > B, then cost a (by) > B, so 

£ ie y clicks^) £ clicks<J (*) B ^ B 



clicks" (by) = ^ v - w w = B ■ ^ l€V t^tftt > — — > 

^ v / — ~j-ct f i~ \ r> ~l : ~l — (7 ( - \ — — 



cost°"(6y)/-B Siev cpCjClicks°"(i) cpCj, 2cpc* ' 

which proves the lemma for the case of a a = 1. Intuitively, in this case the whole budget is spent, 
and since all keywords in V cost at most 2cpc* (by Claim [7]), V gets at least 2c pc* cnc ks- For the 

rest of the proof assume that cost CT (bjv) < B. Then a a = cost < \ 

Another simple case is cost CT (by) < B. Then the budget is not reached and V collects all the 
clicks from U and N: 

clicki a (6y) = clicks CT (fo iV )+clicks ff (6 C 7) > costCT ( b ^) + ^ncki CT (b^) > -i^ + (l- a CT ) cTicki° \b v ). 

Now consider the case when cost°"(&Ar) +cost <J (6(/) > B. Here at most a a fraction of the budget is 
used for the new keywords from N, which cost at most 2cpc* per click, and the remaining budget 
is able to buy (1 — a a ) fraction of the clicks that bjj was getting. 
Define cpc a (bjj) = c ° s ^ i^ 7 ^ > an d similarly for N. Then 

B clicks' 7 (b L 



cpc CT (pu) cosf 7 {bu)/B 



> clicks (pu). (6) 



Now, clicks (b 



V 



cost CT (6 



u 



B 



> 



cost CT (by) cpc CT (b[/) 
(1 - a)B rcost CT (6 
cpc a (bu) 
(l-a)B 
cpc a (bu) 



+ 



+ 



+ 



u 



cost a (b N ) B 
cost CT (by) cpc a (b N ) 
B 



cost a (by) 

COSt CT (&[/) 
COSt CT (6y) 



a) 



(1 
(1 - a 



cpc a (bu] 
B 



+ 



cost CT (6 



N 



B 



+ 



cost" (by) 
cost°"(bjv) 



cpc CT (b 



N 



B 



2cpc* cost cr (6y) 2cpc* 



> (1 — a)clicks (bjj) + a 



B 



2cpc* 



where the first inequality follows because cpc°"(6[/) < 2cpc*, cpc CT (6jv) < 2cpc*, and the quantity 
in square brackets is non- negative. The second inequality follows from ([6]). □ 

Proof of Theorem [6], We now use the above results to prove the theorem. Let ajj be the event 
that clicks for each keyword i G U come in quantity clicks' 717 (i), whose probability is p(&u) = 
rj iet7 Pi(clicks' 7c/ (i)). Here the independence of keywords becomes crucial. In particular, what we 
need is that the number of clicks that come for keywords in U is independent of the number of 
clicks for keywords in N. So the probability of ay is the product of p(a\j) and p(aj^), where ay is 
the event that both ajj and gn happen. Notice that a a of Lemma [8] depends only on keywords in 
N, and is independent of what happens with keywords in U. So here we call it a CTjv . We have 
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£[value(& v )] 



} j p(cry)clicks <JV (b v ) 



a' 



ffN B 



+ {l-a aN ) clicks v (bu) 



> 



2cpc 



> 




-£[value(b)], 



bounding both 2c « and E [value (bjj)] by ^_E7[value(&)] using inequality ([S} and Claim [71 



□ 



Theorem [6] combined with PTAS for the evaluation problem in the independent model (which is 
presented in Section [A3] of the Appendix) gives a simple algorithm for finding a (2 + e)-approximate 
solution: evaluate each integer prefix using the PTAS and output the one with maximum value. 

Theorem 9 There is a (2 + e)- approximation algorithm for the SBO problem in the independent 
model, which runs in time polynomial inn, -, and logM ; where M is the maximum possible cost 
of all clicks. 

5 Scenario model 

In the scenario model, we are given T, B and costs cpc^ as usual. The numbers of clicks are 
determined by a set of scenarios £ and a probability distribution p over it, so that a scenario ueS 
materializes with probability p(o~), in which case each keyword i gets clicks^ clicks. The scenarios 
are disjoint and ^2 a£ Y:P( (J ) = 1- The reason this model does not capture the full generality of 
arbitrary distributions is that we assume that the number of scenarios, |S|, is relatively small, 
in the sense that algorithms are allowed to run in time polynomial in |S|. On the other hand, 
if, for example, we express the independent model in terms of scenarios, their number would be 
exponential in the number of keywords. 

The evaluation of a given solution in the scenario model does not present a problem, as it can be 
done explicitly in time polynomial in |E[, by evaluating each scenario and taking the expectation. 
Nevertheless, this is the most difficult model for SBO that we consider. We show two negative 
results. 

Theorem 10 The SBO problem is NP-hard in the scenario model. 

Theorem 11 The gap between the optimal fractional prefix solution and the optimal (integer or 
fractional) solution to the SBO problem in the scenario model can be arbitrarily large. 

The proof of Theorem 1101 shows . by reduction from clique, that it is NP-hard to find either an 
integer or a fractional solution to this problem. Proofs of the two theorems appear in Sections IA.5I 
and lA.6l respectively. 
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6 Extension to click values 



Here we show that our results easily generalize to the case when the clicks from different keywords 
have different values to the advertiser. For example, a weight associated with a keyword might 
represent the probability that a user clicking on the ad for that keyword will make a purchase. 

For each keyword i, we are given a weight Wi which is the value of a click associated with this 
keyword, and we would like to maximize the weighted number of clicks obtained: 



£[value(6)] = E 



max (l, J2i^T frjCpCjClickSj/i?) 



Obviously, the keywords with Wi = can be just discarded. Now we make a substitution of 
variables, defining clicks^ = t^jclicksj and cpc^ = cpcjwi. Substituting them into the objective 
function, 



£[value(6)] = E 



J2ier ^clicks • 



max (l, Y,ieT ^cpc^clicks-/S) 



we see that the problem reduces to the original unweighted SBO instance, with different keyword 
parameters. The proportional, independent, and scenario models of click arrival maintain their 
properties under this transformation, only some of the distributions for the numbers of clicks have 
to be scaled. 



7 Concluding Remarks 

We have initiated the study of stochastic version of budget optimization. We obtained approxi- 
mation results via prefix bids and showed hardness results for other cases. A lot remains to be 
done, both technically and conceptually. Technically, we need to extend the results to the case 
when there are interactions between keywords, that is, two or more of them apply to a user query 
and some resolution is needed. Also, we need to study online algorithms, including online budget 
optimization. Further, we would like to obtain some positive approximation results for the scenario 
model, which seems quite intriguing from an application point of view. The conceptual challenge 
is one of modeling. Are there other suitable stochastic models for search-related advertising, that 
are both expressive, physically realistic and computationally feasible? 
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A Appendix 



A.l Proof of Theorem [3] 

We use an interchange argument to show that any solution can be transformed into a prefix solution 
without decreasing its value. Consider a solution b. If b is not a prefix solution, then there exist 
keywords i and j with i < j, bi < 1, and bj > 0. Choose the smallest such i and the largest such j. 
If (ftcpCj = 0, set bi = 1 and continue. Otherwise pick the maximum 6i,5j > that satisfy 

qjCpc • 

Si < 1 - bi, Sj < bj, 5i = Sj. 

Qicpci 

If we assign b\ = bi + 5i, b 1 - = bj — 5j, and b' k = b^ for k ^ {i, j}, then we get a solution b' such that 
for any c, cost c (6') = cost c (6) and clicks c (6') > clicks c (b): 

cost c (6') — cost c (b) = c • (c/jcpcjjj — qjCpCjSj) = 

cpc 

clicks c (b ) - clicks c (6) = c • (qidi - qj6j) = c ■ qj( - — l)Sj > 0. 

CpCj 



Since for any c, the value of the solution does not decrease, the expected value over C does not 
decrease either, i£ [value (&')] > £'[value(b)]. As a result of the transformation, either b\ = 1 or 
bj = 0, so the process terminates after a finite number of steps, resulting in a prefix solution with 
expected value at least that of the original one. 

A. 2 Proof of Theorem [4] 

We mark some points in the space of possible prefixes. First, we mark all the integer prefixes. 
Then, for each value c of C that has non-zero probability, we mark the threshold prefix b that 
exactly spends the budget for C = c, i.e. such that cost c (b) = B. This partitions the space 
of prefixes into intervals. Notice that for any two prefix solutions b and b' inside of the same 
interval /, the set of values of C that cause these solutions to exceed the budget is the same, i.e. 
{c | cost c (b) > B} = {c | cost c (6') > B}. Call this set Cf. 

Now we show how to find the optimal prefix solution inside an interval defined by the marked 
points. Consider such an interval /, and suppose that all prefix solutions inside I bid bj = 1 for 
j < i, bj = for j > i, and bi € (b\, 62) for some < b\ < b% < 1. Then the objective function for 
solutions in this interval becomes (analogously to equation Q) 

E c ^) IE © + b A + B ^ c e c?i • v g^±^ — , 

cfe> / £ K ^CpC J +M,C P C i 

which we have to maximize over the possible values of the variable bi. This can be done by taking 
the derivative of this expression with respect to bi and setting it to zero, which has at most one 
solution on the interval (61, 62)- If this solution exists, we add it to a set of interesting points. To 
obtain the overall optimum, we evaluate all prefixes defined by the marked and interesting points. 
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A. 3 Proof of Theorem [5] 



We give an example with three keywords in which the optimal solution bids on the first and third 
keywords, and gets more clicks than any (even fractional) prefix solution. The idea is that the 
second and third keywords cost about the same, but the third one is better because it always comes 
in the same quantity, whereas the second one has high variance. Let cpc 1 = 0, cpc 2 = 1, cpc 3 = 1; 
clicksi = 1 with probability 1, clicks2 = (0 or 1) with probability \ each, and clicks3 = 1 with 
probability 1; B = 1. The optimal solution is b\ = 63 = 1 and 62 = 0, which always gets 2 clicks. 
The best prefix solution is b\ = 62 = ^3 = 1 ; which gets 2 or 1.5 clicks with probability ^ each, or 
only 1.75 clicks in expectation. The example can be modified so that the third keyword is strictly 
more expensive than the second one. 

A. 4 Evaluating a solution in the independent model 

In this section we present a PTAS for the SE problem in the independent model. We are given 
an instance of the SBO problem, and an (integer or fractional) solution b. For a keyword i £ T, 
let Ci = {c I Pi(c) > 0} be the set of values that clicks^ can take. For now we assume that 
\Ci\ is polynomial in the size of the input, and later show how to remove this assumption. Let 
cost(&_j) = Ylj^i frjCpCj clicks j be the cost of clicks for all keywords except i. By some algebraic 
manipulation, one can show the following. 

Claim 12 

£[value(6)] = £ £ Pi (c) he £ — i— Pr[cost(6_ i ) = d], 
where fi(c,d) = max(l, ^±£_EE£i), 

In this expression, he is the number of clicks from keyword i, and the expression in the third sum is 
the amount by which this number should be scaled because of the budget. The variable d represents 
the cost of all keywords other than i. 

As a result, the problem of finding £'[value(6)] reduces to evaluating, for any given i and c, the 
expression 

8 & c ) = E tAa Pr[cost(6_. J ) = d\. (7) 

d>o Mc ' a) 

Lemma 13 For any given e > 0, there is a polynomial-time algorithm that finds a value s' such 
that s(i,c) < s' < (1 + e) s(i,c). 

Proof. We build a dynamic programming table that represents an estimate of Pr[cost(b_j) = d] 
as a function of d. Fix an ordering of elements in T — {i} and construct a table P indexed by j 
and d, where P(j, d) is the probability that the total cost of the first j elements is d. To make sure 
the table is of polynomial size, scale the costs so that the minimum non-zero value of costj for any 
j is 1, and restrict the possible values of d to and (1 + ^) fc for non-negative integers k. If we let 
M = X^igT max { c ■ c P c i I c ^ Ci} be the maximum possible cost of all the clicks, then the number 
of values of d in the table is at most log 1+£/ / n M = 0(j logM), which is polynomial in the size of 
the input. 

The table is initialized with P(0, 0) = 1 and other entries equal to zero. Then for each keyword 
j G T — {i}, each possible number of clicks c 6 Cj, and each entry P(J — l,d) in the previous 
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row, we update P(j, [d + c • cpcj\) = P(j, [d + c • cpcj\) +pj(c) ■ P(j — l,d). Here the operator 
J represents rounding down to the next available value of d. After filling the table, the algorithm 
outputs the value of expression © as determined by probabilities in the last row of the table. 

To bound the error incurred by rounding down the costs, we consider an event a that specifies 
a number of clicks Cj G Cj for each j £ T — {i}, and has probability p(cr) = YljPji c j)- Expression 
([7]) can be rewritten as 

S(Z ' C) = ? P(fT) '/,(c,cosV( M ) (8) 

As a result of a series of updates, the probability of a contributes to some entry of the last 
row of P, say to the one with d a = (1 + ^) k . This d a is an estimate of the value of cost CT (6_j) = 
^Zj^ibjCpCjCj. Since we only rounded down, we have d a < cost°"(6_i). Now note that since 
the intervals between successive powers of (1 + §) are increasing, the biggest amount that we 
could have lost during any one rounding is (1 + f ) fc+1 - (1 + § ) k = f (1 + f ) k = § • d a . Since 
we performed the rounding during at most n updates relevant to a, we have that the true value 
cost°"(6_j) < d a + n ■ — ■ d a = (1 + e)d a . So we have that the estimated cost d a for the event a is 

cost 1 7 £ b -' ) < d° < cost CT (6_ 4 )- 

Now the only thing left to do in order to show that the algorithm evaluates expression ([7]) 
accurately is to take into account /j(c, d). By monotonicity of fi, we have 

/. U C ° S ^ b - l) ] < Med") < /,(c,cost CT (b^)). 



But notice that 



So we have that 



and therefore 



cost CT (6_ 4 )\ ^ / 4 (c,cost CT (ft_i)) 

fi { C ' ~ ) ~ I+E • 



1 + e 
fi(c, cost CT (6_i)) 



1 



Hc,d-: 



l + e 



< fi(c,d°) < / i (c,cost (7 (6_ i )) 
1 (l+e) 



/i(c, cost CT (&_i)) /i(c, COSt CT (b_i)) 

So evaluating expression ([8]) using entries from the dynamic programming table instead of the true 
costs and probabilities incurs a multiplicative error of at most (l + e). □ 



If the input distributions pi are represented implicitly, such that ^ i \Ci\ is not polynomial in 
the input size, then we first convert them into distributions with polynomial number of points by 
combining the probability mass between successive powers of (l + e') into buckets (rounding down). 
Then we run the above algorithm for discrete distributions so as to obtain a (1 + e')-approximation 
for the rounded instance. This will be a (1 + e') 2 -approximation for the original instance, so if e' is 
chosen such that (1 + e') 2 < (1 + e), we obtain the desired (1 + e)-approximation. 

A. 5 Proof of Theorem [10] 

We show that finding the optimal solution, either integer or fractional, to the SBO problem in the 
scenario model is NP-hard. 
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The reduction is from clique. We are given an instance of the clique problem with a graph 
G containing n nodes and m edges, and a desired clique size k. We use G and k to construct an 
instance / of SBO problem and a number V such that there is a solution to / with expected value 
of at least V if and only if G contains a clique of size k. 

To specify /, let us construct a new bipartite graph H = (LUR, E') whose right side R contains 
n nodes corresponding to nodes of G, and whose left side L contains m nodes corresponding to 
edges of G. A node in L corresponding to an edge (u, v) is connected to nodes in R that correspond 
to its endpoints u and v. We first describe the idea of the construction, and later show how to set 
the parameters to make it work. There will be three parameters, small positive values e and S, and 
a large value t. 

All nodes of H are keywords, expensive ones on the left, with cpc^ = 1 for i G L, and cheap 
ones on the right, with cpc^ = e for i G R. The budget is K = The goal will be to force a 
solution to select K nodes from L that are incident to at most k nodes in R, which corresponds 
to finding a set of k nodes with K edges in G, i.e. a A;-clique. The scenarios in the SBO problem 
are as follows. There is a high-probability scenario uo in which one click comes for each keyword 
in L. This scenario is sufficiently likely (occurs with probability 1 — 5) that any integer solution to 
SBO has to bid for at least K of these keywords. Notice that since K is the budget, it does not 
make sense to bid on any more than K keywords. In addition to <ro, there are n scenarios o~\ . . . o~ n , 
each occurring with probability 5/n. Scenario <7j contains K/e clicks for keyword i G R and a large 
number t of clicks for each of i's neighbors from L. 

We now explain the intuition for why there is a good integrally-bidding solution for our SBO 
instance if and only if the graph G contains a fe-clique. By the way we constructed the low- 
probability scenarios, if a solution does not bid on any neighbors of i G R, then in scenario <jj 
it would spend its whole budget on K/e cheap clicks at cost e each, thus obtaining many clicks. 
However, if it bids on any neighbors of i, then most of the budget will be spent on the expensive 
clicks from L, resulting in few clicks overall. So bidding on a keyword I from L effectively ruins the 
scenarios containing Z's neighbors in R. Recall that the high probability of scenario o"o forces any 
good integral solution to bid on exactly K keywords from L. As a result, if G contains a £>clique, 
then it is possible to select K keywords corresponding to edges of G that ruin only k scenarios 
corresponding to nodes of G. However, if there is no /c-clique, then bidding on any K keywords on 
the left ruins at least k + 1 scenarios, thus producing a solution with a lower value. 

We now show how to set the parameters of the construction and prove that the reduction works 
even if fractional bidding on keywords is allowed. First, assume that G contains a fc-clique. Then 
a solution b to /, with bi = 1 for all i G R and bi = 1 for the K keywords in L that correspond to 
edges of the clique, achieves the expected value of at least 

n e 

where the first term is the value from scenario a®, and the second term is the value from scenarios 
<7j such that node i in G is not in the clique. Such scenarios are unaffected by the selected nodes in 
L and therefore get K/e clicks each. There is additional value from scenarios u\ for i in the clique, 
but we disregard it for this lower bound. Thus we get the following claim. 

Claim 14 If G contains a k-clique, then there exists a solution b to I such that £'[value(6)] > V. 

To ensure that if there is no fc-clique in G, then value V cannot be achieved by any bids, we set 
the parameters as follows. 
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1. Select e such that < e < 

2. Select 5 > small enough that — > 0. This is possible because the limit of the 
expression on the left as 5 — > is \. 

3. Let a = ^. 

4. Choose i large enough that (fc+l) ^^"* < ^. This is possible because lim t _ >0O (fc+l) K ^at = 
fc + 1 < i by the choice of e. 

Claim 15 If i?[value(b)] > F /or some fractional solution b to the constructed instance I, then 
there must be at least K keywords i G L such that bi > a. 

Proof. Notice that J2i&L h > (K — 1 + ma) implies that \{i € L \ bi > a}\ > K, because 6j's are 
always at most 1. 

What remains to show is that if J2ieL ^ < — 1 + ma), then £7[value(6)] < V. This follows 
from the way we defined the parameters. Notice that 5^- is an upper bound on the value that any 
solution can obtain from scenarios o\ . . . a n . Then 

, m / w x SK , w 1. SK 1-5 k5K 

E[ va \ue(b)]<(l-5)(K-l + ma) + — = (l-5)(K--) + — = V - — + -— - < V, 

where the first equality comes from the definition of a, and the last inequality from the choice of 5. □ 



Claim 16 Let X = {i € L \ bi > a}. If H contains at least (k + 1) nodes in R that have neighbors 
in X, then £[value(6)] < V. 

Proof. For a node i G R, let a, = J2jeN t where Ni C L is the set of neighbors of i. Assuming 
that there are at least (fc + 1) nodes i € R such that a, > a, we show that £7[value(&)] < V. Notice 
that the value of a solution is always maximized by bidding on all keywords in R, because that 
maximizes the number of cheap clicks. So without loss of generality, we assume that bi = 1 for all 
i € R. 

In a given scenario o"j, b gets clicks^ = ^ + a it, where K/e clicks come from keyword i, and 
ait come from its neighbors in L. The cost paid in this scenario is cost CTi = K + ait, where K is 
spent for the cheap keywords and a^t is spent on keywords of cost 1. Using (1 — S)K as an upper 
bound on the value obtained from uq, and remembering that the budget is K, we have 



E[value(&)] < 



< 
< 



fN ^ 8 Clicks^ 



cost^ /K 

K/e + ait 



(1-S)K + — 

n 

V, 



(k + 1) 



K/e 



at 



K + at 



+ (n — k 



,.11 
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where we use the fact that the fraction in the sum increases with decreasing aj, bound on by a for 
(k + 1) of the terms and by for the others, and use the choice of t for the final inequality. □ 

Clearly, if there is no /c-clique in G, then every K edges in G will be incident on at least k + 1 
nodes. So from the preceding two claims, we may conclude that if G does not contain a fc-clique, 
then no solution to / has expected value of V or more. Together with Claim [T4l this proves that 
SBO problem in the scenario model is NP-hard. 



A.6 Proof of Theorem [TT] 

We give an example in which the ratio between the value of the optimal solution and the value of 
any prefix solution can be arbitrarily large. The example contains n scenarios and 2n keywords, 
numbered 1 through 2n. The cost per click of keywords increases exponentially, with cpCj = c*, 
for some constant c > 1. There is a budget B > 0. Say that the n scenarios are numbered a = 1 to 
n. In scenario a, only keywords 2a — 1 and 2a receive clicks, and they receive B/c 2a ~ 1 clicks each. 
The probabilities of scenarios increase exponentially, and they are equal to ac 2a ~ 1 for scenario a 
(a is chosen to make the probabilities sum to 1). The idea here is that in each scenario, there are 
two types of clicks, cheap and expensive (clicks for the even-numbered keywords are c times more 
expensive than for their preceding odd- numbered keywords), and there are enough cheap clicks to 
spend the whole budget. So for a particular scenario a, the best thing to do is to bid only on the 
cheap keyword 2a — 1, which gets B/c 2a ~ 1 clicks. Bidding on both keywords exceeds the budget 
and decreases the number of clicks to ^-(S/c 2ct_1 ). Since the sets of keywords that receive clicks 
in different scenarios are disjoint, the optimal solution overall (which happens to be integer) is to 
bid on all the odd-numbered keywords, but not the even-numbered ones. This gets the maximum 
number of clicks for each scenario individually, and therefore gets the maximum number of clicks 
in expectation. The expected number of clicks for the optimal solution is therefore 



ac 2a ~ x ■ —r — r = naB. 

c 2a-l 



Now consider some prefix solution for this example, either integer or fractional, and the keyword 
i* such that 6« = 1 for i < i* and bi = for i > i*. Let a* = \i*/2] be the scenario containing clicks 
for keyword i* . Intuitively, the prefix solution ruins the scenarios numbered less than a* because 
it bids for both keywords in them, and it ruins the scenarios numbered greater than a* because it 
does not bid at all for the keywords in them. As a result, a prefix solution can do well in at most 
one scenario. It gets the small number of clicks, ^i(B /c 2<T_1 ), for scenarios a < a*, and it gets 
clicks for scenarios a > a*. It may get up to B/c 2a _1 clicks for a*. So the value of a prefix 
solution is at most 

V ac 2 *- 1 ■ . 2 - , f = + ac 2a *- x ■ -^!— < naB ( — + - 
^ (c + l)^^ 1 c 2a 1 Vc+1 n 

which can be arbitrarily far from OPT = naB as c and n increase. 



A. 7 Extension to multiple-slot auctions 

We now sketch the extension of our results to the case when there are multiple slots, and in 
particular, we assume the Generalized Second Price (GSP) auction currently used by search-related 
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advertising engines. When advertising slots are allocated by a second-price auction with multiple 
slots, the bid amount for a keyword determines the position of the corresponding ads, which affects 
the number of clicks obtained for this keyword and the cost per click of these clicks. When a user 
clicks on the ad in slot i, the advertiser at slot i is charged the bid amount of the advertiser at slot 

i + iE 

Let us first focus on a particular keyword i and an auction in which it participates. If the auction 
has k available slots, then there are k threshold bid amounts, bid\ > bidi > ••• > bid)., such that 
bidding any amount in the interval [bidj,bidj-i) places the ad in slot j, which has a probability of a 
click (clickthrough rate) ctr] and a cost per click cpc^ . Since we are considering the GSP auction, 
the cost per click is not affected by the exact bid amount, as long as it is in the specified interval. 
Both ctr^ and cpc^ are monotone non-decreasing step functions of the bid amount. To better 
represent the options for bidding on keyword i, we visualize the k possible pairs of (cpcjctr^, ctrj) 
values on a "plot" called a landscape. Notice that when both the axes are scaled by the number of 
queries, then it becomes a plot of clicks.; vs. costj, with points for different options of how to bid 
(a more detailed description of landscapes appears in [9]). Landscapes for multiple auctions for the 
same keyword can be combined to obtain an aggregate landscape. 

Some of our results extend to the model with such aggregate landscapes. Roughly speaking, 
a keyword with a landscape can be viewed as a list of several simple individual keywords with 
an additional constraint that a solution has to bid on some prefix of this list. For the fixed 
and proportional models, the optimal solutions without landscapes are prefix solutions anyway (by 
Theorems [2] and [3D, so if we solve the problem as in the one-slot case, the solution will automatically 
satisfy the prefix constraint for keywords with landscapes, which means that it will also be optimal 
for the multiple-slot problem. In the independent model, however, the approximation ratio of 2 for 
the prefix solutions (Theorem [6|), that we prove for the one-slot case, does not extend to the case 
of landscapes. This is because some of the "keywords" are no longer independent, but are actually 
the different bidding options for the same keyword. In fact, a prefix solution can be arbitrarily bad 
compared to the optimal solution, by an example that is very similar to one in Theorem II 1[ The 
only difference is that, instead of keywords 2i — 1 and 2i being coupled by occurring in the same 
scenario, they are coupled by representing the landscape of the same keyword. The negative results 
(Theorems [TO] and [TTT) about the scenario model of course still hold for the more general case of 
multiple-slot auctions. 



There are some details, e.g. the cost is typically a small amount more than the bid of the advertiser at slot i + 1. 
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